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Abstract 

We review a statistical-geometrical and a generalized entropic approach to the uncer- 
tainty principle. Both approaches provide a strengthening and generalization of the standard 
Heisenberg uncertainty relations, but in different directions. 


1 Introduction 

The purpose of this note is to introduce two approaches to the uncertainty principle which have 
been developed recently, a statistical-geometrical approach and a generalized entropic approach. 
But before we go into this, let us consider why one would need a new approach at all. In other 
words, what is unsatisfactory with the traditional approach to the uncertainty principle? In the 
standard textbook approach the uncertainty principle for position and momentum is expressed by 
the inequality 

: ^PA^Q > ^ ( 1 ) 

or more generally: 

W: (A*/t) 2 (A*B) 2 > j(l([AB]-)*| 2 -K[/l^{/l)*.B-<fl)J + ) 2 ) (2) 

for arbitrary observables A and B. Here, A etc. is defined as: 

(A^) 2 = ((A-(A)^)% (3) 

There are three problems. First, uncertainty relations as (1) or (2) presuppose that all observ- 

ables for which one wants to write down an uncertainty relation can be represented as self-adjoint 
(or at least normal [1]) operators. Unfortunately this is not always the case. Notorious examples 
are time and energy, and phase and photon number. Further, in relativistic quantum theory, even 
the status of the position observable becomes dubious. There is no self-adjoint position (vector) 
operator for photons [2]. 

Secondly, note that the right-hand side of (2) still depends on if. It may become zero, even if 
A and B do not commute. In fact, this always happens in an eigenstate of A or B. Then, taken 
as a general statement about A^.AA^B , the inequality only says that this product is greater than 
zero for some states and equal to zero for others. That, however, is true also in classical physics. 
To read off more from (2), one needs to know the state. But then, when if is given, one can also 
calculate A^A and A^B directly, without using the inequality at all. 
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Even in the case where the right-hand side is always strictly greater than zero, as in relation 
(1), there are further problems, due to the properties of the standard deviations. In the definition 
(3) of the standard deviation, the probability density is integrated with a quadratic weight factor 
that puts most emphasis on the tails of the distribution. As a result, the standard deviation can 
become very high, even if 99% or more of the probability distribution is concentrated in a very 
small interval, and the remainder is located in long tails, as e.g. in a Breit-Wigner lineshape. 
Thus a large standard deviation does not. necessarily prevent a probability distribution from being 
very sharply concentrated, and a bound on the product of standard deviations by itself does not 
prevent both observables from being as precisely determined as we please. In the next sections we 
ask whether there are more stringent inequalities that improve on the above aspects. 

2 Statistical-geometrical approach 

It is usual to assume in quantum theory that the state of the system is given. But in this section 
we consider an inverse problem. Suppose we don't know the state of the system. Our problem is 
to make a statistical inference about this state from given measurement results. 

For definiteness, let us assume that some partial information about the state is given: it belongs 
to a given set of (pure) states labeled by an index parameter 0. To be more specific, it is asumed 
that these states are generated by some unitary group: 

|V’*> = e idA ' h \tl>) (4) 

where A is a self-adjoint operator. We can think of this set of states as describing a curve in state 
space. The problem of statistical inference is now equivalent to that of estimating the value of 0 . 

It is clear that a detailed discussion of this estimation problem should involve the kind of mea- 
surements performed, the results obtained, and criteria distinguishing "good’ from 'bad’ estimates. 
However, even without going into details of statistical theory [3, 4], it can be made plausible that a 
tundamental bound for the estimation accuracy is obtained by considering the overlap \(4'e\' l l ) 0+66)\- 
If this overlap is high the states resemble each other much and a typical measurement result which 
would be probable or improbable in one state would likewise be probable or improbable in the 
other. Then one cannot expect to discriminate the states by any measurement procedure. It is 
only when the overlap begins to fall off that there are observables whose probability distributions 
for the states 4-e and ipo+se differ enough to allow for accurate discrimination. 

This suggests the following definition. Choose some fixed value ft < 1 and define the estimation 
inaccuracy 8^0 as the smallest value of 80 for which 

{{i'oW’o+so}] = ft 

Due to the particular choice (4), this overlap does not depend on the value of 0. One can then 
show: [5, 6. 7. 8] 

V0 : 8^0 A > 2 h arccos ft (5) 

This then represents a useful uncertainty relation. It says that an unlimited increase in the 
estimation accuracy of 0 is only possible at the expense of an increased spread in A. 

Several remarks are called for. First, these relations are applicable to any one-parameter uni- 
tary group. Obvious examples are the translations in time, represented by the evolution operators 
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U(t) = exp(— itH/h) or the translations in space, exp (ixP/Ti), where P is the momentum opera- 
tor; or we can consider an angle of rotation and angular momentum, or phaseshifts and photon 
number. In short, we find statistical uncertainty relations of the type (5) in every case where A 
is the generator of a unitary group and 0 the group parameter. This approach is in fact ideally 
suited for a relativistical treatment of quantum theory in which one starts from the construction 
of unitary groups from the symmetries of the system. 

Secondly, relation (5) is asymmetrical; 8^6 is an inaccuracy of estimation of a parameter (i.e. 
a c-number). on the other hand is the r.m.s. spread of a quantum mechanical observable 

(self-adjoint operator). Since one does not need a pair of operators to obtain relation (5) there 
are no problems when such a pair does not exist. 

However if one does exist, e.g. in the case of non-relativistic position and momentum, it is 
possible to take advantage of that fact. Then there is a second, independent, uncertainty relation 
for the spread in position and the estimation accuracy of the parameter in the group of kicks 
{U(p) = exp(— ipQ/h)}, i.e. shifts in momentum. This restores symmetry between position and 
momentum. More importantly, we note that the position operator mimicks the parameter x of 
the translation group in the sense that 


( 0*1 QW'x) = X ( 6 ) 

(assuming (V'lQIV’l) = 0) be. the position operator acts as an unbiassed estimator of the location 
parameter. From this it follows: [9] 

,<^4+ Of 

combined with (5), where A is interpreted as momentum, this result implies the standard uncer- 
tainty relation (1). In fact, as a bonus, we obtain (1) not only for the position operator proper, 
but for any other operator acting as unbiassed estimator of x as well. 

There is only one problem of those mentioned in the previous section that is not solved by the 
relations (5): they still rely on one standard deviation, and thus become useless for states in which 
this diverges. To fix this problem, the standard deviation can be replaced by an interquantile range, 
i.e. the smallest size W^,(A) of an interval W on which a fraction a < 1 of the total probability 
distribution for A is concentrated: J w \(ip\a)\ 2 da = a. A variation of the proof of (5) gives [6, 8] 

S^6W^{A) > Tiarccos — ' — if jd > 2a — 1 (7) 

a 

Finally we note that this concept of estimation inaccuracy fits into a general geometrical 
approach to statistical inference on the basis of the Fisher information metric [4], Let it suffice 
here to note that this metric equips Hilbert space with a. statistical distance between states which 
equals 

d(rp,i{>') = arccos \{y\4>')\ 

and that the geometrical background of (5) is the simple fact that the distance between xl'g and 
(i.e. the right hand side of (5)) is less than the length of the curve (4) connecting these points. 
This also point the way to how the relations are to be generalized in cases where the curve is not 
generated by a unitary group. 
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3 Generalized entropic approach 

For a discrete probability distribution p — (pi, . . . p„), with p, > 0, YliPi — 1) the Shannon entropy 

Hip) := -^2 Pi log pi ( 8 ) 

* 

represents, roughly speaking, a measure of whether the distribution is ‘spread out’ or ‘peaked’. If 
A and B are quantum observables for which an uncertainty principle holds, it is natural to ask 
for a lower bound of the sum of the Shannon entropies for the probabilities |{'i/>|a,)| 2 and (ij>\bj}\ 2 
[10, 11]. Here we assume a discrete spectrum and | a,) and \bj) denote the eigenstates of A and B. 
It turns out that 

H(ip, A) + H(xJ>, B) := H{\(xp\a,)\ 2 ) + H( |<0|6 J )| 2 ) > -2 log sup |(a,|6j)| (9) 

This entropic uncertainty relation limits the concentration of both probability distributions by a 
bound which is independent of 

There is a class of expressions that share many properties with the Shannon entropy, and also 
represent useful measures of ‘peakedness’ or concentration: 

Mr{p) = (X>l +r ) 1/r ( 10 ) 

t 

Their properties have been studied extensively by Hardy, Littlewood and Polya and by Renyi [12]. 
( — log M t is known as Renyi entropy.) Special cases are: 

Moo(p) = supp, , M 0 (p) = e~ HM , M-i{p) = (#{z : p t > 0})~ x (11) 

* 

Where # counts the number of elements in a. set. The generalized entropic uncertainty relation 
then reads: [11, 13] 

M r (|(V>|a I )| 2 )M s (|(V>|6 J )| 2 ) < sup |{a,|6 ; )| 2 for r = -s/(2s + 1) r,s>- 1/2 (12) 

u 

which contains the relations (9) as the special case with r — s = 0. 

Remarks: The above inequalities apply to any pair of discrete observables and yield a non- 
trivial bound iff these observables do not share an eigenstate. (A condition which is slightly 
stronger than mere non-commutativity.) In the case of a two-dimensional Hilbert space, the most 
restrictive bounds are obtained by the choice r = — 1/2, s = oo or v.v. 

Secondly, in the proof of (12) it is not necessary to assume that observables are represented by 
self-adjoint operators. It is sufficient to demand the existence of the sets of “eigenstates” {|a,)} 
and {!&,)}, possibly non-orthogonal, such that 

X>,}<«. \ = n, E\W = 1[ (13) 

i i 

Thus, the approach of this section is also applicable if one accepts unitary operators [14] or even 
more generally, positive-operator- valued measures (POVM’s) [15] as bona fide representations of 
observables. As an example, we mention a phase observable below. 
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In the case of continuous observables, it appears necessary to replace (8) and (10) by a relative 
notion of entropy: 

M r {p\p) := (/(^) 1+r ^) > H -(P\l*) = ~^gM 0 {p\p) = ( 14 ) 

where dp/ dp is the Radon-Nikodym derivative of a probability measure p with respect to a ‘back- 
ground measure’ p. These expressions reflect whether the probability distribution p is concentrated 
in comparison with p. In the discrete case, the absolute entropies are recovered by choosing p to 
be the counting measure #. In the case of continuous observables it is natural to take for p the 
Lebesgue measure, and dp/ dp becomes an ordinary probability density. 

For continuous nondegenerate operators A and B a theorem of Haussdorf and Young analogous 
to (12) leads to [17, 13] 

M r (<MWV,(«’,Q,W:= (/ |Ma)| 2(1+ 'W) ,/r (/ |(V-|(>>| : " ,+ * ) «'(-)‘ / '<siplW6)| 2 

(Still assuming r = -s/(2s + 1).) For position and momentum a slightly stronger inequality 
M r (ip,P\p)M s (xp,Q\p ) > 2(1 + r)(l + 2r) -(1+2r)/(2r) (27rh) -1 is obtained by a theorem of Beckner. 
These inequalities are all sharp for Gaussian ‘minimum uncertainty’ states and strictly imply the 
standard uncertainty relations (1). 

Let N be the photon number operator of a single electromagnetic field mode, with eigen- 
states |n) and £|n)(n| = S. A description of a phase observable by means of a POVM $ was 
constructed by Levy-Leblond and by Susskind and Glogower [14, 16] from the non-orthogonal 
improper “eigenstates” 

W=(2rt)- l/! E^"W 

n 

This yield a resolution of identity / 0 2,r d<f>\<j>) {</>\ - 1 in analogy with (13). Here one finds an 
analogous uncertainty relation 

M r (V>, $)M 4 (t/>, N) < (15) 

where , , 

i , \ l/r /co \ l ' s 

M r (i I/,,*) = (/ |<^|^>r (1+,) ^) , M s (0,yv) = ^El{^l«)l 2(1+s) J 

Let us finally compare the two different approaches. Both improve on the standard approach in 
the sense that they yield bounds which are state independent and strictly imply A^P A^Q > %/ 2. 
The statistical/geo metrical approach is restricted to conjugate pairs of quantities: time/energy, 
location/ momentum, angle/angular momentum etc. All such pairs are treated, on the same 
footing, as consisting of a parameter and an observable. It is essential that the observable is 
represented as a self-adjoint operator because of its role in (4). The approach is relativistically 
invariant. The entropic approach, on the other hand, is applicable to pairs of arbitrary observables, 
not necessarily conjugate pairs or even self-adjoint operators. It is sufficient that they do not share 
eigenstates. However, these uncertainty relations do not seem to have a relativistic generalization. 
Also, the last problem mentioned in the introduction is not completely overcome in this approach. 
IN the example of the Breit-Wigner distribution one still has the result M r (p\p) — 0 for r < 0 
and the inequality is not very restrictive. 
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